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BLAST 

Basic Local Alignment Search Tool 

Edit and Resubmit Save Search Strategies Formatting options Download 



SEQ ID N O: 6 (44-343) Compared to M ouse LKB1 

Results for: ] lcl|41549 SEQ ID NO: 6(433aa)_J 

Your BLAST job specified more than one input sequence. This box lets you choose which input sequence to show BLAST results for. 

Query ID 

ICII41549 

lcl|41549 
Description 

SEQ ID NO: 6 
Molecule type 

amino acid 
Query Length 

433 

Subject ID 

gi|7106425|ref|NP_035622.1| 
Description 

serine/threonine-protein kinase 11 [Mus musculus] >gi|81917862|sp|Q9WTK7.1|STK11_MOUSE RecName: Full=Serine/threonine-protein kinase 
11;AltName: FulNSerine/threonine-protein kinase LKB1 >gi|4838565|gb|AAD31044.1|AF145287_1 Peutz-Jeghers syndrome kinase LKB1 [Mus 
musculus] >gi|66491 01 |gb|AAF21 370.1 |AF1 51 71 1_1 protein kinase LKB1 [Mus musculus] >gi|4530575|gb|AAD22100.1| serine/threonine-protein 
kinase LKB1 [Mus musculus] >gi|4589404|dbj|BAA76749.1| LKB1 [Mus musculus] >gi|5901683|gb|AAD55368.1| Peutz-Jeghers syndrome protein 
[Mus musculus] >gi|30851 173|gb|AAH52379.1| Serine/threonine kinase 11 [Mus musculus] >gi|74186437|dbj|BAE42977.1| unnamed protein 
product [Mus musculus] >gi|74192527|dbj|BAE43050.1| unnamed protein product [Mus musculus] >gi|1 1 761 6790|gb|ABK4241 3.1 1 Stk11 [synthetic 
construct] >gi|148699653|gb|EDL31600.1| serine/threonine kinase 11, isoform CRA_a [Mus musculus] 
Molecule type 

Subject Length 



Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST 
and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. 

Reference - compositional score matrix adjustment 

Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein 
database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. 

Search Parameters 



Search parameter name Search parameter value 

Program blastp 

Query range 44-343 

Word size 3 

Hitlistsize 100 

Gapcosts 11,1 

Matrix BLOSUM62 

Low Complexity Filter Yes 

Filter string L; 

Genetic Code 1 



Params Ungapped Gapped 



Results Statistics parameter name Results Statistics parameter value 



Effective search space 11 0297 

http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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Graphic Summary 

Distribution of Blast Hits on the Query Sequence 



EL 



An overview of the database sequences aligned to the query sequence is shown. The score of each alignment is indicated by one of five different colors, 
which divides the range of scores into five groups. Multiple alignments on the same database sequence are connected by a striped line. Mousing over a hit 
sequence causes the definition and score to be shown in the window at the top, clicking on a hit sequence takes the user to the associated alignments. 
New: This graphic is an overview of database sequences aligned to the query sequence. Alignments are color-coded by score, within one of five score 
ranges. Multiple alignments on the same database sequence are connected by a dashed line. Mousing over an alignment shows the alignment definition 
and score in the box at the top. Clicking an alignment displays the alignment detail. 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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Dot Matrix 

Plot of lcl|41549 vs gi|7106425|ref|NP_035622.1| I?l 

This dot matrix view shows regions of similarity based upon the BLAST results. The query sequence is represented on the X-axis and the numbers 
represent the bases/residues of the query. The subject is represented on the Y-axis and again the numbers represent the bases/residues of the subject. 
Alignments are shown in the plot as lines. Plus strand and protein matches are slanted from the bottom left to the upper right corner, minus strand 
matches are slanted from the upper left to the lower right. The number of lines shown in the plot is the same as the number of alignments found by BLAST. 



m 



Descriptions 



Alignments 



sp|Q9WTK7.l|STKll_MOUSE RecName: Full =Serine/ threonine -protein kinase 11; AltName: Full=Serine/threonine-pro 

gbl AAD31044 . 1 I AF145287 1 Peutz- Jeghers syndrome kinase LKB1 [Mus musculus] 
9 more sequence titles - 

gb|AAF21370.1|AF151711 1 protein kinase LKB1 [Mus musculus] 
gb[AAD22100.1| serine7threonine-protein kinase LKB1 [Mus musculus) 
dbj |BAA76749. 1 | LKB1 [Mus musculus] 

gb[AAD55368.1l Peutz-Jeghers syndrome protein [Mus musculus] 
gb AAH52379.ll Serine/threonine kinase .11 [Mus musculusl 
dbj |EAE42977 . 1 | unnamed protein product [Mus musculus] 



— , protein product [Mus musculus] 

gb|ABK42413.1| Stkll [synthetic construct] 

gb|EDL31600.l| serine/threonine kinase 11, isoform CRA_a [Mus musculus] 



trix adjust. 
(0%) 

Query 44 KLIGKYLMGDLLGEGSYGKVKEVLDSETLCRRAV PNGEANVKKEIQLLR 103 

KLIGKYLMGDLLGEGSYGKVKEVLDSETLCRRAVKILKKKKLRRIPNGEANVKKEIQLLR 
Sbjct 44 KLIGKYLMGDLLGEGSYGKVKEVLDSETLCRRAVKILKKKKLRR I PNGEANVKKEIQLLR 103 

Query 104 RLRHKNVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYFCQLIDG 163 

RLRH+NVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYF QLIDG 
Sbjct 104 RLRHRNVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHQYFEQLIDG 163 

Query 164 LEYLHSQGIVHKDIKPgnlllr.r.ggtlKISDLGVAEALHPFAADDTCRTSQGSPAFQPPE 223 

LEYLHSQGIVHKDIKPGNLLLTT GTLKISDLGVAEALHPFA DDTCRTSQGSPAFQPPE 
Sbjct 164 LEYLHSQGIVHKDIKPGNLLLTTNGTLKISDLGVAEALHPFAVDDTCRTSQGSPAFQPPE 223 

Sb j ct 224 IANGLDTFSGFKVDIWSAGVTLYNITTGLYPFEGDNIYKLFENIGRGDFTIPCDCGPPLS 

Query 284 DLLKGMLEYEPAKRFSIRQIRQHSWFRKKHppaeap\qpiopsuDTKDRWRSMTVVPYLED 
DLL+GMLEYEPAKRFSIRQIRQHSWFRKKHP AEA VPIPPSPDTKDRWRSMTWPYLED 
Sbjct 284 DLLRGMLEYEPAKRFSIRQIRQHSWFRKKHPLAEALVPIPPSPDTKDRWRSMTWPYLED 



http : //blast, ncbi .nlm. nih. gov/B last . cgi 
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BLAST 



Basic Local Alignment Search Tool 



Blast 2 sequences 
STRAD comparison 

Results for: 

|ref|NP_001003787.1 STE20-related kinase adapter protein alpha isoform 1 [Homo sapiens] >gi|74759034|sp|Q7RTN6.1|STRAA_...{431aa)j:J 
Your BLAST job specified more than one input sequence. This box lets you choose which input sequence to show BLAST results for. 

Query ID 

qj|51242955lreflNP 001003787.11 
gi|51242955|ref|NP_001003787.1| 
Description 

STE20-related kinase adapter protein alpha isoform 1 [Homo sapiens] >gi|74759034|sp|Q7RTN6.1|STRAA_HUMAN RecName: 
Full=STE20-related kinase adapter protein alpha; Short=STRAD alpha; AltName: Full=STE20-related adapter protein; AltName: 
Full=Serologically defined breast cancer antigen NY-BR-96 >gi|34494889|tpg|DAA01797.1| TPA_exp: STE20-related adaptor protein 
[Homo sapiens] >gi|119614691|gb|EAW94285.1| protein kinase LYK5, isoform CRA_c [Homo sapiens] 

Molecule type 
amino acid 

Query Length 
431 



Subject ID 

4 subjects 
Description 
Molecule type 

amino acid 
Subject Length 

Pr ° 9 8 BLASTP 2.2.24+ Citation 



Reference 

Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. 

Reference - cor '' ' 



Search parameter name Search parameter value 

Program blastp 

Expect value 10 
Hitlistsize 100 
Gapcosts 11,1 



Params Ungapped Gapped 

Lambda 0.318619 0267 



Results Statistics 

Results Statistics parameter name Results Statistics parameter value 

Effective search space ™~ 144800 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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Graphic Summary 



Distribution of 4 Blast Hits on the Query Sequence 



EL 



An overview of the database sequences aligned to the query sequence is shown. The score of each alignment is indicated by one of five 
different colors, which divides the range of scores into five groups. Multiple alignments on the same database sequence are connected by a 
striped line. Mousing over a hit sequence causes the definition and score to be shown in the window at the top, clicking on a hit sequence takes 
the user to the associated alignments. New: This graphic is an overview of database sequences aligned to the query sequence. Alignments are 
color-coded by score, within one of five score ranges. Multiple alignments on the same database sequence are connected by a dashed line. 
Mousing over an alignment shows the alignment definition and score in the box at the top. Clicking an alignment displays the alignment detail. 



Color key for alignment scores 




1 



80 



160 



240 



320 



400 




http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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Descriptions 

Legend for links to other i 



IS UniGene H GEO □ Gene M Structure El Map Viewer 11 PubChem BioAssay 
Sequences producing significant alignments: 



Accession Description 



Max Total Query f 
score score coverage \ 



Links 

e 



PREDICTED: similar 



in kinase LYK5 isoform 2 isoform 1 [Canis familiaris] 7g6 
id kinase adapter protein alpha [Bos taurus] 
>gi|240849233|ref|NP_001 155356.1 1 STE20-related kinase adapter protein alpha 
[Ovisaries] >gi|75070042|sp|Q5E9J9.1|STRAA_BOVIN RecName: Full=STE20- 
related kinase adapter protein alpha; Short=STRAD alpha; AltName: Full=STE20- 
related adapter protein >gi|59858207|gb|AAX08938.1 1 protein kinase LYK5 isoform 
4 [Bos taurus] >gi|238566930|gb|ACR46653.1| STRADA [Ovis aries] 
>gi|296476245|gb|DAA1 8360.1 1 STE20-related kinase adapter protein alpha [Bos 748 
taurus] >ref|NP_001155356.1| STE20-related kinase adapter protein alpha [Ovis 
aries] >sp|Q5E9J9.1|STRAA_BOVIN RecName: Fuil=STE20-related kinase 
adapter protein alpha; Short=STRAD alpha; AltName: Full=STE20-related adapter 
protein >gb|AAX08938.1| protein kinase LYK5 isoform 4 [Bos taurus] 
>gb|ACR46653.1| STRADA [Ovis aries] >gb|DAA18360.1| STE20-related kinase 
adapter protein alpha [Bos taurus] 
STE20-related kinase adapter protein alpha [Mus musculus] 
>gi|12847582|dbj|BAB27626.1 1 unnamed protein product [Mus musculus] 
>gi|33638094|gb|AAQ24157.1| protein kinase LYK5 splice variant 1 [Mus 
musculus] >gi|35192984|gb[AAH58517.1| RIKEN cDNA 2610019A05 gene [Mus 
musculus] >gi|1 1761 6946|gb|ABK42491 .1 1 STLK5 [synthetic construct] 
>gi|123243203|emb|CAM27017.1| novel protein [Mus musculus] 
>gi|148702325|gb|EDL34272.1| RIKEN cDNA 261001 9A05, isoform CRA_b [Mus 
musculus] >gi 1 1 48702326|gb| EDL34273. 1 1 RIKEN cDNA 2610019A05, isoform 775 
CRA_b [Mus musculus] >dbj|BAB27626.1 1 unnamed protein product [Mus 
musculus] >gb|AAQ24157.1| protein kinase LYK5 splice variant 1 [Mus musculus] 
>gb|AAH58517.1| RIKEN cDNA 2610019A05 gene [Mus musculus] 
>gb|ABK42491.1| STLK5 [synthetic construct] >emb|CAM27017.1| novel protein 
[Mus musculus] >gb|EDU34272.1| RIKEN cDNA 2610019A05, isoform CRA b 
[Mus musculus] >gb|EDL34273.1| RIKEN cDNA 2610019A05, isoform CRA^b 
[Mus musculus] 

STE20-related kinase adapter protein alpha [Rattus norvegicus] 
>gi|81912054|sp|Q7TNZ6.1|STRAA_RAT RecName: Fuli=STE20-related kinase 
adapter protein alpha; Short=STRAD alpha; AltName: Full=STE20-related adapter 
protein >gi|3308721 3|gb|AAP92801 .1 1 protein kinase LYK5 [Rattus norvegicus] 758 
>sp|Q7TNZ6.1|STRAA_RAT RecName: Full=STE20-related kinase adapter protein 
alpha; Short-STRAD alpha; AltName: Full=STE20-related adapter protein 
>gb|AAP92801.1| protein kinase UYK5 [Rattus norvegicus] 



EEE3 



BEEl 



. EZ3 



Alignments 

Select All Get selected s€ 



:s Multiple alignment 



Identities 
Query 42 



>ref |XP_850260.1| EISEI PREDICTED: similar to protein kinase LYK5 isoform 2 isoform 1 
[Canis familiaris] 
Length=3 94 

■elated kinase adaptor alpha 

16 bits (2030), Expect = 0.0, Method: Compositional matrix adjust. 
= 376/390 (97%), Positives = 381/390 (98%1 , Gaps = 0/390 (0%) 

TNDASSESIASFSKQEVMSSFLPEGGCYELLTVIGKGFEDLMTWLARYKPTGEYVTVRR 101 
TN+ASSESIAS SKQE+MSSFLPEGG YELLT+ IGKGFEDLMTVNLARYKPTGEYVTVRR 
Sbjct 5 TNEASSESIASLSKQEIMSSFLPEGGRYELLTIIGKGFEDLMTTOLARYKPTGEYVTVRR 64 

Query 102 INLEACSNEMVTFLQGELHVSKLFNHPNIVPYRATFIADNELWWTSFMAYGSAKDLICT 161 

INLEACSNEMVTFLQGELHVSKLF+HPNI + PYRATFIADNELWWTSFMAYGSAKDLICT 
Sbjct 65 INLEACSNEMVTFLQGELHVSKLFSHPNILPYRATFIADNELWWTSFMAYGSAKDLICT 124 

Query 162 HFMDGMNELAIAYILQGVLKALDYIHHMGYVHRSVKASHILISVDGKVYLSGLRSNLSMI 221 

HFMDGMNELAIAYILQGVLKALD i ± I - I I LRSNLSMI 

Sbjct 125 HFMDGMNEIiAIAYILQGVLKALDYIHHMGYVHRSVKASHILISSDGKVYLSGLRSNLSMI 184 

Query 222 SHGQRQRWHDFPKYSVKVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKD 281 

SHGQRQRWHDFPKYSVKVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKD 
Sbjct 185 SHGQRQRWHDFPKYSVKVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKD 244 

Query 282 MPATQMLLEKLNGTVPCLLDTSTIPAEELTMSPSRSVANSGLSDSLTTSTPRPSNGDSPS 341 

MPATQMLLEKLNGTVPCLLDTSTIPAEELTMS SRS ANSGLSDSL TSTPR SNGDSPS 
Sbjct 245 MPATQMLLEKLNGTVPCLLDTSTIPAEELTMSTSRSAANSGLSDSLATSTPRTSNGDSPS 3 04 

Query 342 HPYHRTFSPHFHHFVEQCLQRNPDARPSASTLLNHSFFKQIKRRASEALPELLRPVTPIT 401 

HPYHRTFSPHFHHFVEQCLQRNPD RPSASTLLNHSFFKQIKRRASEALPELLRPVTPIT 
Sbjct 305 HPYHRTFSPHFHHFVEQCLQRNPDVRPSASTLLNHSFFKQIKRRASEALPELLRPVTPIT 364 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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>ref |NP_001015603.1| EMAJ STE20-related kinase adapter protein alpha [Bos taurus] 
ref |NP_001155356.1| EH STE20-related kinase adapter protein alpha [Ovis aries] 
sp | Q5E9 J9 . 1 | STRAA_BOVIN H RecName : Full=STE20-related kinase adapter protein alpha 

alpha; AltName: Full=STE20-related adapter protein 
gb|AAX0893 8.1| E protein kinase LYK5 isoform 4 [Bos taurus] 
gb|ACR4 6653.1| 0 STRADA [Ovis aries] 

gb |DAA18360. 1 | E STE20-related kinase adapter protein alpha [Bos taurus] 
Length=3 73 

related kinase adaptor alpha [Bos taurus] 

748 bits (1932) , Expect = C 
s = 356/373 (96%) , Positives 

MSSFLPEGGCYELLTVIGKGFEDLMTWIARYKPTGEYVTVRRINLEACSNEMVTFLQGE 118 
MSSFLPEGGCYELLTVIGKGFEDLMTVNLARYKPTGEYVTVRRINLEACSNEMVTFLQGE 
MSSFLPEGGCYELLTVIGKGFEDLMTVNLARYKPTGEYVTVRRINLEACSNEMVTFLQGE 6 0 

LHVSKLFNHPNIVPYRATFIADNELWWTSFMAYGSAKDLICTHFMDGMNELAIAYILQG 178 
LHVSKLF+HPNI + PY ATFIADNELWWTSFMAYGSAKDLICTHFMDGM+ELAIAYILQG 
LHVSKLFSHPNILPYGATFIADNELWVVTSFMAYGSAKDLICTHFMDGMSELAIAYILQG 12 0 

Query 179 VLKALDYIHHMGYVHRSVKASHILISVDGKVYLSGLRSNLSMISHGQRQRWHDFPKYSV 238 

LKALDYIHHMGYVHRSVKASH+LIS DGKVYLSGLRSNLSMISHGQRQRWHDFPKYS + 
Sbjct 121 ALKALDYIHHMGYVHRSVKASHVLISADGKVYLSGLRSNLSMISHGQRQRWHDFPKYSI 180 

Query 239 KVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKDMPATQMLLEKLNGTVPC 298 

KVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKDMPATQMLLEKLNGTVPC 
Sbjct 181 KVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKDMPATQMLLEKLNGTVPC 240 

Query 299 LLDTSTIPAEELTMSPSRSVANSGLSDSLTTSTPRPSNGDSPSHPYHRTFSPHFHHFVEQ 358 

LLDTSTIPAEELTMS SRS ANSGLS+SL STPR SNGDS PSHPYHRTFS PHFHHFVEQ 
Sbjct 241 LLDTSTIPAEELTMSTSRSAANSGLSESLAPSTPRTSNGDSPSHPYHRTFSPHFHHFVEQ 300 

Query 359 CLQRNPDARPSASTLLNHSFFKQIKRRASEALPELLRPVTPITNFEGSQSQDHSGIFGLV 418 
CLQRNPD RPSASTLLNHSFFKQIKRRASEALPELLRPVTPIT FEG QSQDHSGIFGLV 
CLQRNPDMRPSASTLLNHSFFKQIKRRASEALPELLRPVTPITTFEGRQSQDHSGIFGLV 360 



Sbjct 



TNLEELEVDDWEF 
TNLEELEVDDWEF 



Sbjct 301 
Query 419 
Sbjct 361 

>ref |NP_082402.1| PlEll'll STE20-related kin< 
dbj|BAB27626.1| 13 unnamed protein product 
gb|AAQ24157.l| G3 protein kinase LYK5 spl: 

gb|AAH58517.1| H RIKEN cDNA 2610019A05 gene [Mus musculus] 
gb|ABK42491.1| STLK5 [synthetic construct] 
emb|CAM27017.l| H novel protein [Mus musculus] 

gb|EDL34272.1| B RIKEN cDNA 2610019A05, isoform CRA_b [Mus musculus] 
gb|EDL34273.1| H RIKEN cDNA 2610019A05, isoform CRA_b [Mus musculus] 
Length=3 94 

)- related kinase adaptor alpha [Mus musculus] 



e adapter protein alpha [Mus musculus) 
[Mus musculus] 
e variant 1 [Mus musculus] 



Query 42 TNDASSESIASFSKQEVMSSFLPEGGCYELLTVIGKGFEDLMTVNLARYKPTGEYVTVRR 101 



Query 102 INLEACSNEMVTFLQGELHVSKLFNHPNIVPYRATFIADNELWWTSFMAYGSAKDLICT 161 

INLEACSNEMVTFLQGELH VS KLF+HPNIVP YRATF IADNELWWTS FMAYGSAKDLI T 

Sbjct 65 INLEACSNEMVTFLQGELHVSKLFSHPNIVPYRATF IADNELWWTS FMAYGSAKDLIGT 124 

Query 162 HFMDGMNELAIAYILQGVLKALDYIHHMGYVHRSVKASHILISVDGKVYLSGLRSNLSMI 221 

HFMDGMNELAIAYILQGVLKALDYIHHMGYVHRSVKASHILIS DGKVYLSGLRSNLSMI 

Sbjct 125 HFMDGMNELAIAYILQGVLKALDYIHHMGYVHRSVKASHILISTDGKVYLSGLRSNLSMI 184 

Query 222 SHGQRQRWHDFPKYSVKVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKD 281 

SHGQRQR VHDFPKYS+ KVLPWLS PEVLQQNLQGYDAKSDI YS VG ITACELANGHVPFKD 

Sbjct 185 SHGQRQRAVHDFPKYS I KVLPWLS PEVLQQNLQGYDAKSDIYSVG ITACELANGHVPFKD 244 

Query 282 MPATQMLLEKLNGTVPCLLDTSTI PAEELTMS PSRSVANSGLSDS LTTSTPRPSNGDS PS 341 

MPATQMLLEKLNGTVPCLLDTSTIPAEELTMSPSRS+AN GL+DSL + RPSNGDSPS 

Sbjct 245 MPATQMLLEKLNGTVPCLLDTSTI PAEELTMS PSRS I ANPGLNDSLAAGSLRPSNGDS PS 304 

Query 342 HPYHRTFSPHFHHFVEQCLQRNPDARPSASTLLNHSFFKQIKRRASEALPELLRPVTPIT 401 
HPYHRTFSPHFH+FVEQCLQRNPDARP+ASTLLNHSFFKQIKRRASEALPELLRPVTPIT 

HPYHRTFSPHFHNFVEQCLQRNPDARPNASTLLNHSFFKQIKRRASEALPELLRPVTPIT 364 



Sbjct 305 
Sbjct 



NFEGSQSQDHSGIFGLVTNLEELEVDDWEF 431 
NFEGSQSQDHSGIFGLVTNLE+LEVDDWEF 
NFEGSQSQDHSGIFGLVTNLEDLEVDDWEF 3 94 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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>ref |NP_877972.1| PlMi'il STE20-related kinase adapter protein alpha [Rattus norvegicus] 
sp|Q7TNZ6.1|STEAA_RAT H RecName : Full=STE20-related kinase adapter protein alpha; Short =STRAD 

alpha; AltName: Full=STE20-related adapter protein 
gblAAP92801.ll protein kinase LYK5 [Rattus norvegicus] 

Length=3 93 

GENE ID: 303605 Strada | STE20-related kinase adaptor alpha [Rattus norvegicus] 
(10 or fewer PubMed links) 

Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust. 
Identities = 362/390 (93%), Positives = 377/390 (97%j , Gaps = 1/390 (0%) 

Query 42 TNDASSESIASFSKQEVMSSFLPEGGCYELLTVIGKGFEDLMTVNLARYKPTGEYVTVRR 101 

N+ASSESIASFSK E+MSSFLPEGGCYELL+VIGKGFEDLMTVN +RYKPTGEYVTVRR 
Sbjct 5 ANEASSESIASFSKPEIMSSFLPEGGCYELLSVIGKGFEDLMTVN-SRYKPTGEYVTVRR 63 

Query 102 INLEACSNEWTFLQGELHVSKLFNHPNIVPYRATFIADNELWWTSFMAYGSAKDLICT 161 

INLEACSIiC T T r r I h VSKLF+HPNIVPYRATFIADNELW VTSFMAYGSAKDLI T 
Sbjct 64 INLEACSNEMVTFLQGELHVSKLFSHPNIVPYRATFIADNELWAVTSFMAYGSAKDLIGT 123 

Query 162 HFMDGMNELAIAYILQGVLKALDYIHHMGYVHRSVKASHILISVDGKVYLSGLRSNLSMI 221 

HFMDGM+ELAIAYILQGVLKALDYIHHMGYVHRSVKASHILIS DGKVYLSGLRSNLSMI 
Sbjct 124 HFMDGMSELAIAYILQGVLKALDYIHHMGYVHRSVKASHILISTDGKVYLSGLRSNLSMI 183 

Query 222 SHGQRQRWHDFPKYSVKVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHVPFKD 281 

SHGQRQR VHDFPKYS +KVLPWLS PEVLQQNLQG YDAKSD I YS VG ITACELANGHVPFKD 
Sbjct 184 SHGQRQRA VHDFPKYSIKVLPWLSPEVLQQNLQGYDAKSD I YSVG ITACELANGHVPFKD 243 

Query 282 MPATQMLLEKLNGTVPCLLDTSTIPAEELTMSPSRSVANSGLSDSLTTSTPRPSNGDSPS 341 

MPATQMLLEKLNGTVPCLLDTSTIPAEELTMS PSRS + AN GL+DSL + RP+NGDSPS 
Sbjct 244 MPATQMLLEKLNGTVPCLLDTSTIPAEELTMSPSRSIANPGLNDSLAAGSLRPANGDSPS 3 03 

Query 342 HPYHRTFSPHFHHFVEQCLQRNPDARPSASTLLNHSFFKQIKRRASEALPELLRPVTPIT 401 

HPYHRTFSPHFH+FVEQCLQRNPDARP+ASTLLNHSFFKQIKRRASEALPELLRPVTPIT 
Sbjct 304 HPYHRTFSPHFHNFVEQCLQRNPDARPNASTLLNHSFFKQIKRRASEALPELLRPVTPIT 363 

Query 402 NFEGSQSQDHSGIFGLVTNLEELEvDDWEF 431 

+FEGSQSQDHSGI GLVTNLE+LEVDDWEF 
Sbjct 364 SFEGSQSQDHSGILGLVTNLEDLEVDDWEF 393 



Select All G et selected sequences Distance tree of results Wtifijea'i griment 
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vs. Schizosaccharomyces pombe 


pmo25 


51.7 


53.8 


0.718 


0 


0 


Blast 


vs. Arabidopsis thaliana 


AT4G17270 


44.7 


52.1 


0.764 


0 


0 


Blast 


vs. Arabidopsis thaliana 


AT5G47540 


43.5 


52.5 


0.753 


0 


0 


Blast 


vs. Oryza sativa 


Os07g0585100 


47.0 
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vs. Danio rerio 
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Blast 
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Blast 
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Blast 


vs. Pan troglodytes 
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vs. Danio rerio 
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65.4 
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0 
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vs. Anopheles gambiae 
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0 


Blast 


vs. Rattus norvegicus 


Cab39 


51.7 


53.7 


0.720 


0 


0 


Blast 


vs. Danio rerio 


cab39 


51.7 


52.5 


0.753 


0 


0 


Blast 


vs. Drosophila melanogaster 


Mo25 


52.1 


51.1 


0.791 


0 


0 


Blast 


vs. Anopheles gambiae 


AgaP_AGAP000812 


50.9 


50.1 


0.821 


0 


0 


Blast 


vs. Anopheles gambiae 


AgaP_AGAP0 11060 


50.6 


50.0 


0.824 


0 


0 


Blast 


vs. Caenorhabditis elegans 


mop-25.1 


51.7 


53.3 


0.731 


0 


0 


Blast 


vs. Caenorhabditis elegans 


mop-25.2 


48.8 


52.0 


0.765 


0 


0 


Blast 


vs. Arabidopsis thaliana 


AT4G17270 


39.9 


51.0 


0.794 


0 


0 


Blast 


vs. Arabidopsis thaliana 


AT5G47540 


40.4 


50.3 


0.816 


0 


0 


Blast 


vs. Oryza sativa 


Os07g0585100 


43.0 


51.4 


0.782 


0 


0 


Blast 


Arabidopsis thaliana 


AT4G17270 














vs. Homo sapiens 


CAB39 


44.7 


52.4 


0.756 


0 


0 


Blast 


vs. Pan troglodytes 


CAB39 


44.7 


52.4 


0.756 


0 


0 


Blast 


vs. Canis lupus familiaris 


CAB39 


44.7 


52.1 


0.764 


0 


0 


Blast 


vs. Bos taurus 


CAB39 


44.7 


52.1 


0.764 


0 


0 


Blast 


vs. Mus musculus 


Cab39 


45.0 


53.1 


0.737 


0 


0 


Blast 


vs. Rattus norvegicus 


Cab39 


44.7 


52.2 


0.761 


0 


0 


Blast 


vs. Danio rerio 


cab39 


45.3 


53.2 


0.734 


0 


0 


Blast 


vs. Drosophila melanogaster 


Mo25 


42.4 


51.3 


0.785 


0 


0 


Blast 


vs. Anopheles gambiae 


AgaP_AGAP000812 


43.1 


51.1 


0.793 


0 


0 


Blast 


vs. Anopheles gambiae 


AgaP_AGAP0 11060 


43.1 


50.6 


0.807 


0 


0 


Blast 


vs. Caenorhabditis elegans 


mop-25.1 


45.5 


50.8 


0.800 


0 


0 


Blast 


vs. Caenorhabditis elegans 


mop-25.2 


42.9 


49.5 


0.839 


0 


0 


Blast 


vs. Schizosaccharomyces pombe 


pmo25 


39.9 


51.0 


0.794 


0 


0 


Blast 


vs. Arabidopsis thaliana 


AT5G47540 


87.2 


85.4 


0.162 


0 


0 


Blast 


vs. Oryza sativa 


Os07g0585100 


70.8 


70.5 


0.374 


0 


0 


Blast 


Arabidopsis thaliana 


AT5G47540 














vs. Homo sapiens 


CAB39 


43.5 


52.5 


0.753 


0 


0 


Blast 


vs. Pan troglodytes 


CAB39 


43.5 


52.5 


0.753 


0 


0 


Blast 


vs. Canis lupus familiaris 


CAB39 


43.5 


52.1 


0.764 


0 


0 


Blast 


vs. Bos taurus 


CAB39 


43.5 


52.5 


0.753 


0 


0 


Blast 


vs. Mus musculus 


Cab39 


43.5 


51.7 


0.775 


0 


0 


Blast 


vs. Rattus norvegicus 


Cab39 


43.5 


51.5 


0.781 


0 


0 


Blast 


vs. Danio rerio 


cab39 


43.8 


52.1 


0.764 


0 


0 . 


Blast 


vs. Drosophila melanogaster 


Mo25 


42.1 


50.2 


0.818 


0 


0 


Blast 


vs. Anopheles gambiae 


AgaP_AGAP000812 


43.3 


49.8 


0.830 


0 


0 


Blast 


vs. Anopheles gambiae 


AgaP_AGAP011060 


42.9 


48.5 


0.869 


0 


0 


Blast 


vs. Caenorhabditis elegans 


mop-25.1 


46.5 


50.1 


0.822 


0 


0 


Blast 


vs. Caenorhabditis elegans 


mop-25.2 


42.2 


49.9 


0.827 


0 


0 


Blast 


vs. Schizosaccharomyces pombe 


pmo25 


40.4 


50.3 


0.816 


0 


0 


Blast 


vs. Arabidopsis thaliana 


AT4G17270 


87.2 


85.4 


0.162 


0 


0 


Blast 


vs. Oryza sativa 


Os07g0585100 


76.5 


72.1 


0.349 


0 


0 


Blast 


Oryza sativa 


Os07g0585100 














vs. Homo sapiens 


CAB39 


47.0 


54.1 


0.711 


0 


0 




vs. Pan troglodytes 


CAB39 


47.0 


54.1 


0.711 


0 


0 




vs. Canis lupus familiaris 


CAB39 


47.0 


54.1 


0.711 


0 


0 




vs. Bos taurus 


CAB39 


47.0 


54.5 


0.701 


0 


0 
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. Mus musculus 


Cab39 


47.0 


53.9 


0.716 


0 


0 


Blast 


. Rattus norvegicus 


Cab39 


47.0 


54.2 


0.708 


0 


0 


Blast 


. Danio rerio 


cab39 


47.0 


53.7 


0.722 


0 


0 


Blast 


. Drosophila melanogaster 


Mo25 


46.2 


52.3 


0.758 


0 


0 


Blast 


. Anopheles gambiae 


AgaP_AGAP000812 


46.0 


51.6 


0.776 


0 


0 


Blast 


. Anopheles gambiae 


AgaP_AGAP0 1 1 060 


44.8 


50.8 


0.800 


0 


0 


Blast 


. Caenorhabditis elegans 


mop-25.1 


48.5 


52.7 


0.747 


0 


0 




. Caenorhabditis elegans 


mop-25.2 


45.5 


51.2 


0.789 


0 


0 




. Schizosaccharomyces pombe 


pmo25 


43.0 


51.4 


0.782 


0 


0 




. Arabidopsis thaliana 


AT4G17270 


70.8 


70.5 


0.374 


0 


0 


Blast 


. Arabidopsis thaliana 


AT5G47540 


76.5 


72.1 


0.349 


0 


0 





'e present th 



substitutions per site, as defined below: 



> d: the number of nucleotide substitutions per site, corrected for multiple substitutions using the method of Jukes and Cantor (1969). 

» d N /d s : the ratio of the rate of nonsynonymous substitutions (d N ) to the rate of synonymous substitutions(dg), calculated using the method of 

Nei and Gojobori (1986). A high value of this metric indicates adaptive selection, whereas a low value indicates purifying selection. 
• d NR /d NC : the ratio of radical nonsynonymous substitutions (d NR ) to conservative nonsynonymous substitutions (d NC ), calculated using the 
method of Hughes etai, (1990). This metric is analogous to d N /d s , but it has the advantage of being useful for studying the evolution of 
at diverged in the distant past. 



>r. Mammalian 



nn III. New York: Academic Press; 



• Nei M, Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 
1986;3(5):418-26. 

> Hughes al, Ota T. Nei m. Positive Darwinian selection promotes charge profile diversity in the antigen-binding cleft of class I MHC molecules. Mol 

Bi ol Evol. 1990;7(6):515-24. 
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EXHIBIT D: 
COMPARISON OF RAT AND HUMAN LKB1 
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BLAST 



Basic Local Alignment Search Tool 



Edit and Resubmit Save Search Strategies Formatting options Download 
Blast 2 sequences 

SEQ ID NOjji 

Results for: | lcl|1 7777 SEQ ID NO: 6(43333)^ 

Your BLAST job specified more than one input sequence. This box lets you choose which input sequence to 
show BLAST results for. 



Query ID 

Id] 17777 

lcl|17777 
Description 

SEQ ID NO: 6 
Molecule type 

amino acid 
Query Length 

433 



Subject ID 

gi| 1 57820995|ref|NP_001 1 01 539. 1 1 
Description 

serine/threonine-protein kinase 11 [Rattus norvegicus] >gi|149034596|gb|EDL89333.1| serine/threonine 

kinase 1 1 (predicted), isoform CRA_a [Rattus norvegicus] 
Molecule type 

amino acid 
Subject Length 

436 
Program 

BLASTP 2.2.24+ Citation 



Reference 

Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, 
and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Reference - compositional score matrix adjustment 

Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. 
Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution 
matrices", FEBS J. 272:5101-5109. 

Other reports: Search Summary fTaxonomv reports! [Multiple alignment] 
Search Parameters 



Search parameter name Search parameter value 



Program blastp 

Query range 44-343 

Word size 3 

Expect value 10 

Hitlist size 100 

Gapcosts 11,1 

Matrix BLOSUM62 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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Low Complexity Filter Yes 

Filter string L; 

Genetic Code 1 

Window Size 40 

Threshold 1 1 

Composition-based stats 2 

Karlin-Altschul statistics 



Params Ungapped Gapped 

Lambda 0.320459 0.267 
K 0.139595 0.041 

H 0.429592 0.14 

Results Statistics 



Results Statistics parameter name Results Statistics parameter value 

Effective search space 1 1 0297 



Graphic Summary 

Distribution of Blast Hits on the Query Sequence 



LZL 



An overview of the database sequences aligned to the query sequence is shown. The score of each alignment 
is indicated by one of five different colors, which divides the range of scores into five groups. Multiple 
alignments on the same database sequence are connected by a striped line. Mousing over a hit sequence 
causes the definition and score to be shown in the window at the top, clicking on a hit sequence takes the user 
to the associated alignments. New: This graphic is an overview of database sequences aligned to the query 
sequence. Alignments are color-coded by score, within one of five score ranges. Multiple alignments on the 
same database sequence are connected by a dashed line. Mousing over an alignment shows the alignment 
definition and score in the box at the top. Clicking an alignment displays the alignment detail. 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 
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Dot Matrix View 

Plot of lcl|17777 vs gi|157820995|ref|NP_001 101539.1 1 1?l 

This dot matrix view shows regions of similarity based upon the BLAST results. The query sequence is 
represented on the X-axis and the numbers represent the bases/residues of the query. The subject is 
represented on the Y-axis and again the numbers represent the bases/residues of the subject. Alignments are 
shown in the plot as lines. Plus strand and protein matches are slanted from the bottom left to the upper right 
corner, minus strand matches are slanted from the upper left to the lower right. The number of lines shown in 
the plot is the same as the number of alignments found by BLAST. 

H 

Descriptions 

Score E 

Sequences producing significant alignments: (Bits) Value 

refjNP 0011Q1539.ll serine/threonine-protein kinase 11 [Rattu... 525 le-153 



Alignments 



Score = 525 bits (1351), Expect = 
Identities = 287/300 (96%), Posits 

KLIGKYLMGDLLGEGS YGKVKEVLDSETLCRRAVki Ikkkkl rr i PNGEANVKKE IQLLR 103 
KLIGKYLMGDLLGEGSYGKVKEVLDSETLCRPJWKILKKKKLRRIPNGEANVKKEIQLLR 
KLIGKYLMGDLLGEGS YGKVKEVLDSETLCRRAVKILKKKKLRRIPNGEANVKKEIQLLR 103 

RLRHKOTTQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYFCQLIDG 163 
RLRH+NVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYF QLIDG 
RLRHRNVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYFRQLIDG 163 

LEYLHSQGIVHKDIKPgnlllttggtlKISDLGVAEALHPFAADDTCRTSQGSPAFQPPE 223 
LEYLHSQGIVHKDIKPGNLLLTT GTLKI S DLGVAEALH PFA DDTCRTSQGSPAFQPPE 
LEYLHSQGIVHKDIKPGNLLLTTNGTLKISDLGVAEALHPFAVDDTCRTSQGSPAFQPPE 223 

IANGLDTFSGFKVDIWSAGVTLYNITTGLYPFEGDNIYKLFENIGKGSYAIPGDCGPPLS 283 
IANGLDTFSGFKVDIWSAGVTLYNITTGLYPFEGDNIYKLFENIG+G + IP DC PPLS 
IANGLDTFSGFKVDIWSAGVTLYNITTGLYPFEGDNIYKLFENIGRGDFTIPCDCAPPLS 283 

DLLKGMLEYEPAKRFSIRQIRQHSWFRKKHppaeapvpippspDTKDRWRSMTVVPYLED 343 
DLL+GMLEYEPAKRFSIRQIRQHSWFRKKHP AEA VPIPPSPDTKDRWRSMTWPYLED 
DLLRGMLEYEPAKRFSIRQIRQHSWFRKKHPLAEALVPIPPSPDTKDRWRSMTWPYLED 343 



Query 


44 


Sbjct 


44 


Query 


104 


Sbjct 


104 


Query 


164 


Sbjct 


164 




224 


Sbjct 


224 


Query 


284 


Sbjct 


284 
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11/15/2010 



